CDS
Accession Number | TCMCG075C17994 |
gbkey | CDS |
Protein Id | XP_017976874.1 |
Location | join(37319840..37319959,37321066..37321167,37321362..37321505,37321610..37321690,37321781..37321850,37322099..37322169,37322245..37322408,37322502..37322573,37322665..37322902,37323135..37323197,37323287..37323571,37323674..37323787,37323890..37324016,37324106..37324275,37324367..37324468,37325408..37325511,37325597..37325888,37326040..37326284,37326857..37327360,37327476..37327770,37327874..37328057,37328267..37328355) |
Gene | LOC18600278 |
GeneID | 18600278 |
Organism | Theobroma cacao |
Protein
Length | 1211aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018121385.1 |
Definition | PREDICTED: protein HASTY 1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAAGAAGGCAACAGTAACGACAGCAAAGTAAACAATGTGGCTAGAGCCATTGTTGCAGCCCTTGATTGGAACTCTACTCCTGATGCTCGCAAAGCTGCCGTGTCTTACCTTGAATCCATCAAAGCAGGAGATATACGAATTTTGGCAAACACATCATTCCTTTTAGTCAAAAAAAATTGGTCTTCAGAAATTCGGTTACATGCATTTAAAATGCTACAGCACTTAGTTCGGTTGCGGTGGGAGGAATTTGGTCCTTTAGAACGTAAGAACTTTGCGAATGTTGCTGTTGAGTTAATGTCTGAAATTGCAGATCCTTGTGAGGAATGGGCTTTGAAAAGTCAAACAGCCGCCCTTGTCGCTGAGATGGTTAGAAGAGAAGGACTAAATCTATGGCAAGAGCTGCTTCCCTCTCTAGTTTCCCTATCTAGCAAGGGTCCTGTACAAGCTGAGTTGGTCTCAATGATGCTAAGATGGCTTCCTGAAGATATTACTGTGCACAACGAAGATTTGGAAGGTGATCGACGTAGATTACTGTTACGTGGGCTTACTCAATCTTTGCCAGAAATTTTGCCACTACTATACACATTATTGGAAAGGCATTTTGGAGCTGTATTAAGTGAGGTGAGTAGGCAACAACTTGACATTGCAAAACAGCATGCAGCTGCTGTAACAGCTACTCTAAATGCTGTCAATGCCTATGCCGAATGGGCTCCTTTGCCTGATCTTGCTAAATACGGCATCATTCATGGGTGTGGTTTCCTACTGTCGTCTCCTGATTTTCGTCTTCATGCTTGTGAGTTTTTCAAACTTGTCTCTCCAAGAAAGAGACCTGCTGATGATGCTGCTTCTGAATTTGATTCTGCAATGAATAGCATCTTTCAGATCTTGATGAATGTATCCAGAGAATTTTTAGTCAGATCTAGCTCTACAGGTGGGGCTATAGATGAAAGTGACTGTGAATTTGCAGAATATGTATGTGAAAGCATGGTGTCTTTGGGTTCCTCAAACTTGCAATGTATTGTCGGAGATAGCACTACACTCTCTCTTTATTTACTACAGATGCTGGGGTTCTTTCAACATTTTAAGCTGGCTCTTCATTATCAATCCCTGCAATTTTGGTTGGCACTAATGAGGGATTTGATGTCAAAGCCAAAGCTTCATTCAGCTGGAGATGGTTCAGCTGTTACCAATGTGGATTCTACTTCAGCGCAGGTTGATAATGAAAAGAGAAAGATTTTAAGTTTTCTGAATGATGATATTTGTAGTGCAATTCTGGATATATCTTTCCAACGCATGCTTAAAAAAGAAAAGCTTATGACTGGAACAGCCCTCTCTCTGGGGGTTTTGGAGTTGTGGAGTGACGATTTTGAGGGCAAGGGTGATTTTGGCCAGTACCGTTCTAGGCTGCTTGACTTAATCAAGTTTATTGCTTCAAACAAGCCCCTTGTGGCTGGTGCTAAAATTTCTGAAAGAATTATTATGATCATTAAGAACCTCTTGAATTCACCAATGCCTGCTCAGGTCTTAGTTGTGATGGAAAGCATGCAAGTGGCTCTGGAAAATGTTGTTAGCTCTATATTTGATGGATCAAATGAGTTTGCCGGTGGTGGTTCAGAAGTTCACCTGGCTTTGTGTAGAATATTTGAAGGTTTACTTCGGGAACTTCTTTCTTTAAATTGGACTGAGCCTGCCCTTGTGGAAGTACTTGGACGCTATCTAGATGCAATGGGTCCCTTCCTCAAGTATTTCCCAGATGCAGTTGGCAGTGTCATCAATAAGCTTTTTGAGCTCCTAAATTCACTTCCTTTCGTTGTTAAGGATCCTTCAACAAGCAGTGCACGACATGCAAGGTTGCAGATTTGTACATCATTTATTCGAATGGCCAAAGCTGCTGACAAAAGTATTCTGCCTCACATGAAGGGTATTGCTGATACAATGGCATATTTACGAAGAGAAGGTTGTTTGCTTCGTGGTGAACATAATCTTCTAGGTGAAGCGTTTCTTGTCATGGCTTCTGCTGCTGGGATTCAACAACAGCAAGAAGTTTTGGCATGGTTACTTGAACCCTTGAGCCAACAGTGGATACCAATAGAGTGGCAAAACAATTACCTGTCTGAACCGCTTGGTCTAGTTCGTTTATGCTCCGATACAGCATTTATGTGGTCACTTTTCCACACTGTGACATTTTTTGAGAAAGCACTTAAGAGGAGTGGAATGAGGAAAGGCAATTTGAACTTACAAAACAGCTCAACAGCAAGTTCTACCCCACATCCAATAGCTGCTCATCTGTCTTGGATGCTGCCTCCTCTCTTAACACTGCTTCGTGCTATACATTCCCTTTGGTCACCATCCATATTCCAAACATTACCTGGAGAAATTAAAGCAGCAATGAGCATGAGTGATGTGGAGCGGTCCAGTCTTCTTGGCGGTGGGAACCCCAAATTGTCAAAGGGTGCATTAACTTTCATAGATGGATCTCAGTTTGATGTGAATAAGGAAGGATATACAGAACCAAATGAAGCTGACATACGAAATTGGTTAAAGGGTATCAGAGACAGTGGGTACAATGTATTGGGCCTATCTACTACCATTGGAGATCCATTTTTTCAATTCATGGACATTGATTCTGTTGCTTTAGCTCTAATTGAGAATATACAATCAATGGAGTTCAGACATACAAGGCAGCTTGTTCATTCTATTTTAATTCCTCTGGTTAAGTCCTGTCCTCCAGATATGTGGGAGGTCTGGCTGGAAAAGCTACTGCACCCGTTATTTGTCCACTGTCAGCGAGCTCTTAGTTGTTCATGGTCCAGTCTTTTGCATGAAGGCCGGGCGAAGGTTCCCGATAATCATGGTATTCTCACTGGGTCAGACTTGAAAGTGGAAGTAATGGAGGAAAAATTGCTTCGAGATTTAACTCGTGAGATATGTTTGCTCCTCTCTACTATGGCATCACCTGGATTAAATGCTACCCTTCCTAATTTAGAACATTCTGGGCATTTTGGCCGTGTGGACATGTCTTCTCTGAAAGATTTGGATGCATTTGCATCAAGCTCTATGGTTGGTTTCCTTTTGAAGCACAAAAGCCTGGCAATTCCAGTATTGCAGATTTCATTAGAAGCATTCACTTGGACAGACAGTGAAGCTGTGACCAAAGTTTGTTCCTTTTCTGCTGCTGTGGTTCTTCTAGCTATATTTACAAACAATGTGGAAATCCAGGAATTTGTTTCTAGAGATCTATTTTCTGCAGTCATCCGAGGTTTGGCCCTTGAGTCAAATGCTGTTATCAGTGCTGATCTGGTTAATCTCTGTCGCGAAATATTCATATATCTCTGTGACAGAGATACGGCTCCCAGGCAGATTTTACTCTCCCTTCCTTCTATTAGCCCCAACGATTTACATGCCTTTGAAGAAGCCTTGGCAAAGACCGCTAGTCCTAAAGAACAAAAGCAGCATATGAGGAGCTTGCTTTTATTAGCTAGTGGGAACAACTTAAAAGCTCTTGCTGCTCAGAAAAGTGTAAACATTATAACAAATGTTACAACGAGGCCTCGCGGTTCAGTCAATGTGCCTGAAAATAGAATCGATGAGGGGGACACCAATCACACCATAGGCTTGGCAGCAATTTTGTGA |
Protein: MEEGNSNDSKVNNVARAIVAALDWNSTPDARKAAVSYLESIKAGDIRILANTSFLLVKKNWSSEIRLHAFKMLQHLVRLRWEEFGPLERKNFANVAVELMSEIADPCEEWALKSQTAALVAEMVRREGLNLWQELLPSLVSLSSKGPVQAELVSMMLRWLPEDITVHNEDLEGDRRRLLLRGLTQSLPEILPLLYTLLERHFGAVLSEVSRQQLDIAKQHAAAVTATLNAVNAYAEWAPLPDLAKYGIIHGCGFLLSSPDFRLHACEFFKLVSPRKRPADDAASEFDSAMNSIFQILMNVSREFLVRSSSTGGAIDESDCEFAEYVCESMVSLGSSNLQCIVGDSTTLSLYLLQMLGFFQHFKLALHYQSLQFWLALMRDLMSKPKLHSAGDGSAVTNVDSTSAQVDNEKRKILSFLNDDICSAILDISFQRMLKKEKLMTGTALSLGVLELWSDDFEGKGDFGQYRSRLLDLIKFIASNKPLVAGAKISERIIMIIKNLLNSPMPAQVLVVMESMQVALENVVSSIFDGSNEFAGGGSEVHLALCRIFEGLLRELLSLNWTEPALVEVLGRYLDAMGPFLKYFPDAVGSVINKLFELLNSLPFVVKDPSTSSARHARLQICTSFIRMAKAADKSILPHMKGIADTMAYLRREGCLLRGEHNLLGEAFLVMASAAGIQQQQEVLAWLLEPLSQQWIPIEWQNNYLSEPLGLVRLCSDTAFMWSLFHTVTFFEKALKRSGMRKGNLNLQNSSTASSTPHPIAAHLSWMLPPLLTLLRAIHSLWSPSIFQTLPGEIKAAMSMSDVERSSLLGGGNPKLSKGALTFIDGSQFDVNKEGYTEPNEADIRNWLKGIRDSGYNVLGLSTTIGDPFFQFMDIDSVALALIENIQSMEFRHTRQLVHSILIPLVKSCPPDMWEVWLEKLLHPLFVHCQRALSCSWSSLLHEGRAKVPDNHGILTGSDLKVEVMEEKLLRDLTREICLLLSTMASPGLNATLPNLEHSGHFGRVDMSSLKDLDAFASSSMVGFLLKHKSLAIPVLQISLEAFTWTDSEAVTKVCSFSAAVVLLAIFTNNVEIQEFVSRDLFSAVIRGLALESNAVISADLVNLCREIFIYLCDRDTAPRQILLSLPSISPNDLHAFEEALAKTASPKEQKQHMRSLLLLASGNNLKALAAQKSVNIITNVTTRPRGSVNVPENRIDEGDTNHTIGLAAIL |